AITopics

Industry: Leisure & Entertainment > Games > Chess (0.61)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.50)

Neural Information Processing SystemsFeb-7-2026, 19:55:04 GMT

DeepSynopticMonteCarloPlanningin ReconnaissanceBlindChess

This paper introduces deep synoptic Monte Carlo planning (DSMCP) for large imperfect information games.

artificial intelligence, infostate, machine learning, (15 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Leisure & Entertainment > Games (0.90)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Neural Information Processing SystemsDec-23-2025, 21:01:47 GMT

Deep Synoptic Monte-Carlo Planning in Reconnaissance Blind Chess

This paper introduces deep synoptic Monte Carlo planning (DSMCP) for large imperfect information games. The algorithm constructs a belief state with an unweighted particle filter and plans via playouts that start at samples drawn from the belief state.

deep synoptic monte-carlo planning, name change, reconnaissance blind chess, (6 more...)

Industry: Leisure & Entertainment > Games > Chess (0.49)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)

Neural Information Processing SystemsOct-9-2024, 18:29:57 GMT

Deep Synoptic Monte-Carlo Planning in Reconnaissance Blind Chess

DSMCP is the basis of the program Penumbra, which won the official 2020 reconnaissance blind chess competition versus 33 other programs. This paper also evaluates algorithm variants that incorporate caution, paranoia, and a novel bandit algorithm. Furthermore, it audits the synopsis features used in Penumbra with per-bit saliency statistics.

belief state, deep synoptic monte-carlo planning, reconnaissance blind chess, (3 more...)

Industry: Leisure & Entertainment > Games > Chess (0.69)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.49)

Bertram, Timo, Fürnkranz, Johannes, Müller, Martin

Neural Network-based Information Set Weighting for Playing Reconnaissance Blind Chess

arXiv.org Artificial IntelligenceJul-8-2024

In imperfect information games, the game state is generally not fully observable to players. Therefore, good gameplay requires policies that deal with the different information that is hidden from each player. To combat this, effective algorithms often reason about information sets; the sets of all possible game states that are consistent with a player's observations. While there is no way to distinguish between the states within an information set, this property does not imply that all states are equally likely to occur in play. We extend previous research on assigning weights to the states in an information set in order to facilitate better gameplay in the imperfect information game of Reconnaissance Blind Chess. For this, we train two different neural networks which estimate the likelihood of each state in an information set from historical game data. Experimentally, we find that a Siamese neural network is able to achieve higher accuracy and is more efficient than a classical convolutional neural network for the given domain. Finally, we evaluate an RBC-playing agent that is based on the generated weightings and compare different parameter settings that influence how strongly it should rely on them. The resulting best player is ranked 5th on the public leaderboard.

information, neural network, opponent, (13 more...)

doi: 10.1109/TG.2024.3425803

2407.05864

Country:

North America > Canada > Alberta (0.14)
Europe > Austria > Upper Austria > Linz (0.04)
Europe > Austria > Vienna (0.04)
(8 more...)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Bertram, Timo, Fürnkranz, Johannes, Müller, Martin

Efficiently Training Neural Networks for Imperfect Information Games by Sampling Information Sets

arXiv.org Artificial IntelligenceJul-8-2024

In imperfect information games, the evaluation of a game state not only depends on the observable world but also relies on hidden parts of the environment. As accessing the obstructed information trivialises state evaluations, one approach to tackle such problems is to estimate the value of the imperfect state as a combination of all states in the information set, i.e., all possible states that are consistent with the current imperfect information. In this work, the goal is to learn a function that maps from the imperfect game information state to its expected value. However, constructing a perfect training set, i.e. an enumeration of the whole information set for numerous imperfect states, is often infeasible. To compute the expected values for an imperfect information game like \textit{Reconnaissance Blind Chess}, one would need to evaluate thousands of chess positions just to obtain the training target for a single state. Still, the expected value of a state can already be approximated with appropriate accuracy from a much smaller set of evaluations. Thus, in this paper, we empirically investigate how a budget of perfect information game evaluations should be distributed among training samples to maximise the return. Our results show that sampling a small number of states, in our experiments roughly 3, for a larger number of separate positions is preferable over repeatedly sampling a smaller quantity of states. Thus, we find that in our case, the quantity of different samples seems to be more important than higher target quality.

evaluation, imperfect information game, information, (11 more...)

2407.05876

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Europe > Austria > Upper Austria > Linz (0.04)
North America > United States > Texas (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Chess (0.93)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.32)

Bertram, Timo, Fürnkranz, Johannes, Müller, Martin

Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess

arXiv.org Artificial IntelligenceAug-3-2022

In this work, we adapt a training approach inspired by the original AlphaGo system to play the imperfect information game of Reconnaissance Blind Chess. Using only the observations instead of a full description of the game state, we first train a supervised agent on publicly available game records. Next, we increase the performance of the agent through self-play with the on-policy reinforcement learning algorithm Proximal Policy Optimization. We do not use any search to avoid problems caused by the partial observability of game states and only use the policy network to generate moves when playing. With this approach, we achieve an ELO of 1330 on the RBC leaderboard, which places our agent at position 27 at the time of this writing. We see that self-play significantly improves performance and that the agent plays acceptably well without search and without making assumptions about the true game state.

agent, game state, opponent, (11 more...)

2208.02029

Country:

North America > Canada > Alberta (0.14)
Europe > Austria > Upper Austria > Linz (0.05)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Chess (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Sulyok, András Attila, Karacs, Kristóf

Towards Using Fully Observable Policies for POMDPs

arXiv.org Artificial IntelligenceJul-24-2022

Partially Observable Markov Decision Process (POMDP) is a framework applicable to many real world problems. In this work, we propose an approach to solve POMDPs with multimodal belief by relying on a policy that solves the fully observable version. By defininig a new, mixture value function based on the value function from the fully observable variant, we can use the corresponding greedy policy to solve the POMDP itself. We develop the mathematical framework necessary for discussion, and introduce a benchmark built on the task of Reconnaissance Blind TicTacToe. On this benchmark, we show that our policy outperforms policies ignoring the existence of multiple modes.

agent, pomdp, probability, (15 more...)

2207.11737

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Europe > Hungary > Budapest > Budapest (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Games (0.48)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Markowitz, Jared, Gardner, Ryan W., Llorens, Ashley J.

On the Complexity of Reconnaissance Blind Chess

arXiv.org Artificial IntelligenceNov-7-2018

This paper provides a complexity analysis for the game of reconnaissance blind chess (RBC), a recently-introduced variant of chess where each player does not know the positions of the opponent's pieces a priori but may reveal a subset of them through chosen, private sensing actions. In contrast to commonly studied imperfect information games like poker and Kriegspiel, an RBC player does not know what the opponent knows or has chosen to learn, exponentially expanding the size of the game's information sets (i.e., the number of possible game states that are consistent with what a player has observed). Effective RBC sensing and moving strategies must account for the uncertainty of both players, an essential element of many real-world decision-making problems. Here we evaluate RBC from a game theoretic perspective, tracking the proliferation of information sets from the perspective of selected canonical bot players in tournament play. We show that, even for effective sensing strategies, the game sizes of RBC compare to those of Go while the average size of a player's information set throughout an RBC game is much greater than that of a player in Heads-up Limit Hold 'Em. We compare these measures of complexity among different playing algorithms and provide cursory assessments of the various sensing and moving strategies.

artificial intelligence, information, machine learning, (19 more...)

1811.03119

Country: North America > United States (0.46)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.47)